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Abstract 

We propose a natural process for allocating n balls into n bins that are organized as 
the vertices of an undirected graph G. Each ball first chooses a vertex u in G uniformly at 
random. Then the ball performs a local search in G starting from u until it reaches a vertex 
with local minimum load, where the ball is finally placed on. In our main result, we prove 
that this process yields a maximum load of only 0(log log n) on expander graphs. In addition, 

we show that for d-dimcnsional grids the maximum load is 6 ( ( lo '° ^ - ) d+1 ^ . Finally, for 
almost regular graphs with minimum degree f2(logn), we prove that the maximum load is 
constant and also reveal a fundamental difference between random and arbitrary tie-breaking 
rules. 

1 Introduction 

It is well known that if each of n balls is placed sequentially into one of n bins chosen inde- 
pendently and uniformly at random, then the highest loaded bin is likely to contain €)( i giog n ) 
balls. We call this process the 1-choice process. Alternatively, in the d-choice process, each ball 
is allowed to choose d bins independently and uniformly at random and is placed in the least 
loaded among the d bins. It was shown by Azar et al. 0] and Karp et al. fll| that the maximum 
load reduces drastically to 6 ( Xo f}°^, - ) in the d-choice process. The constants omitted in the O 
are known I3| and can be improved by considering more careful tie-breaking rules as shown by 
Vocking fl8^ T Berenbrink et al. extended these results to the case where the number of balls 
is larger than the number of bins. 

In some applications, it is important to allow each ball to choose bins in a correlated way. 
For example, such correlations occur naturally in distributed systems, where the bins represent 
processors that are interconnected as a graph and the balls represent tasks that need to be 
assigned to processors. From a pratical point of view, letting each task choose d independent 
random bins may be undesirable, since the cost of accessing two bins which are far away in the 
graph may be higher than accessing two bins which are nearby. Furthermore, in some contexts, 
tasks are actually created by the processors, which are then able to forward tasks to other 
processors to achieve a more balanced load distribution. In such settings, allocating balls close 
to the processor that created it is certainly very desirable as it reduces the costs of probing the 
load of a processor and allocating the task. 

We propose a very natural and simple process for allocating balls into bins that are inter- 
connected as a graph. We refer to this process as local search allocation. At each step, a ball is 
"born" in a bin chosen independently and uniformly at random, which we call the birthplace of 
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the ball. Then, starting from its birthplace, the ball performs a local search in the graph, where 
in each step the ball moves to the adjacent bin with the smallest load, provided that the load 
is strictly smaller than the load of the bin the ball is currently in. Unless otherwise stated, we 
assume that ties are broken uniformly at random. The local search ends when the ball visits 
the first vertex that is a local minimum, which is a vertex for which no neighbor has a smaller 
load. After that, the next ball is born and is allocated according to the procedure described 
above, and this process is repeated. See Figure [Q for an illustration. 
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Figure 1: Illustration of the local search allocation. Black circles represent the vertices 1-6 arranged as a path, and the 
yellow circles represent the balls of the process (the most recently allocated ball is marked red). Figure (a) shows the 
configuration after placing i — 1 balls. As shown in Figure (b), ball i born at vertex 4 has two choices in the first step of 
the local search (vertices 3 or 5) and is finally allocated to vertex 2. Figure (c) and (d) shows the placement of ball i + 1 
and i + 2. 

In this paper we analyze the asymptotic behavior of the maximum load obtained when 
n balls are allocated to n bins as n — > oo. The main question is whether the local search 
allocation ensures a small load amount like the (i-choice process with d 2. Our main result 
gives a positive answer for the important case where the n balls are allocated to n bins which 
are organized as the vertices of an expander graph. We show, in this case, that the maximum 
load is G (log log n), which has the same order of magnitude as of the (i-choice process (refer to 
Theorem 13.11 for a more general statement of Theorem 11.11 b elow) . 

Theorem 1.1 (Expander graphs). Let G be any expander graph with constant maximum degree. 
Then, as n — )• oo, the maximum load after n balls are allocated is O(loglogn) with probability 
l-o(l). 

Remark 1.2. Theorem 11.11 holds also when ties are not broken uniformly at random but by 
means of a fixed permutation. In this tie-breaking procedure, for each vertex v £ V, we associate 
an arbitrary permutation £„ of the neighbors of v. Then, whenever a ball is currently at v during 
its local search, the ball breaks ties by using the order in the permutation £„. With this tie- 
breaking procedure, the local search allocation is a deterministic function of the birthplaces of 
the balls and the permutations {Cv}vev- 

Before discussing and comparing our main result with existing results, we proceed to state 
our other results. Recall that, unless otherwise stated, we assume that in the local search 
allocation ties are broken uniformly at random. An important instance for applications is when 
the bins are organized as a ring or a grid. The theorem below establishes the maximum load in 
this case up to constant factors. 

Theorem 1.3 (Grid graphs). Let G be any d-dimensional grid graph, where d 1 is any integer 

i - 

d+l 



constant. Then, as n — > oo, the maximum load after n balls are allocated is 
with probability 1 — o(l). 



log n 
log log n 



In comparison to Theorem 11.11 the above theorem shows that the maximum load can be 
quite high on graphs with small expansion. Besides the expansion, it is also conceivable that 
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a large degree ensures a small maximum load. The next theorem confirms this intuition by 
showing that a constant maximum load is obtained for any almost regular graph with minimum 
degree O(logn), where a graph is almost regular if the ratio between the minimum and maximum 
degrees is a constant. This class of graphs includes hypercubes and Erdos-Renyi random graphs 
with average degree (1 + e) logn, for any e > 0. 

Theorem 1.4 (Dense graphs). Let G be any almost regular graph with minimum degree ^(log n). 
Then, as n — )• oo, the maximum load after n balls are allocated is 0(1) with probability 1 — o(l). 

In the next two theorems, we do not restrict the local search allocation to break ties uniformly 
at random. In particular, we show that deviating from the usual random tie breaking rule can 
dramatically increase the maximum load. Here, as in Remark II. 2 \ we define a tie-breaking rule 
as a collection of variables {6u}t>eV> where each £ v is a permutation of the neighbors of v. 

Theorem 1.5 (Impact of tie-breaking rules). For any d = w(l) as n — >■ oo, there is a d-regular 
graph and a choice of {£ v }vev f or which the maximum load after n balls are allocated is at least 
fi( min {d 1//4 , ^^j}) probability 1 — o(l). 

To highlight the difference between random tie breaking and arbitrary tie breaking, Theo- 
rem [T3] establishes that there exists a d-regular graph G with d = B(logn) and a tie-breaking 
rule for which the maximum load in G is fi((log n) 1//4 ). On the other hand, by Theorem 11.41 we 
have that breaking ties uniformly at random leads to a constant maximum load in G. 

Our final result establishes some lower bounds on the maximum load. These lower bounds 
hold for arbitrary tie-breaking rules. 

Theorem 1.6 (Lower bounds). For any graph with maximum degree A, the maximum load after 
n balls are allocated is at least £1 ( l °iJ°^ n j with probability 1 — o(l) as n — >■ oo. Furthermore, 
for any integer 2 ^ d ^ W=1L ) there is a d-regular graph for which the maximum load after n 



balls are allocated is at least n \J j with probability 1 - o(l) as « ^ oo. 

The combination of Theorem 11.51 and the second statement of Theorem 11.61 shows that the 
two conditions concerning the degree and the random tie breaking in Theorem 11.41 are not only 
sufficient but also necessary for obtaining a constant maximum load. 

While in the d-choice process a ball only probes the load of up to d bins, the local search 
allocation may probe the load of w(l) bins for some balls. However, the number of bins whose 
load a given ball can probe is bounded above by A times the load of the birthplace of the ball, 
where A is the maximum degree of G. Therefore, the expected number of probed bins per ball 
is at most A and, for the case of expander graphs with constant maximum degree, Theorem 11.1 
implies that the maximum number of probed bins per ball is O(loglogn). 

An important feature of our local search allocation is that it maintains a smooth load 
distribution (cf. Lemma l2.2p ; i.e., the load difference for each edge of G is at most one and balls 
are only placed in bins whose load is a local minimum. Hence, if each ball is controlled by an 
agent who strives for a minimization of their load, then the agents have a natural incentive to 
follow the local search allocation. 

It is also important to remark that our process uses only a small amount of randomness. 
For instance, when ties are broken by means of a deterministic collection of {fulueVj the only 
randomness comes from the birthplaces of the balls, which requires only nlog 2 n random bits. 



From this perspective, it is comparable to the process by Mitzenmacher et al. 15], which use 
the same number of random bits and achieves a maximum load of O (log logn) as well. 

Further Related Work. The work that is most related to ours is that of Kenthapadi 
and Panigrahy [l^]. They analyzed a balls-into-bins model where each ball chooses a pair of 
adjacent bins uniformly at random and is placed in the bin with the smaller load. They proved 
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that, for any d-regular graph, the maximum load after n balls are allocated is log logn + 0(1 + 

log n 



. , a— r ), and also showed an almost matching lower bound. Hence, to retain the maximum 

log(rf/ log n)> ° 

load of O (log logn) from the 2-choice process, the degree d must be as large as Q(nP( 1 <V og ^° gn ''). 
In contrast, our local search allocation achieves a maximum load of O(loglogn) even for a 
large class of constant-degree graphs. The model from Kenthapadi and Panigrahy [l2] was 
also studied implicitly in Peres et al. where the authors analyzed the gap defined as the 
difference between the maximum and average load. They proved that, even if the number of 
balls m is much larger than the number of bins n, the gap is G(logn) for expander graphs and, 
for the cycle, the gap is between fi(logn) and O(nlogn), i.e., independent of the number of 
balls. 

In a different context, Adler et al. [l| studied a related graph-based coupon collector process. 
In this process, each ball performs a local search but distinguishes only between empty and 
nonempty bins. Therefore, each ball is allocated either to its birthplace or to one of its neighbors. 
The authors analyzed the number of balls required until all bins are non-empty. 

There are several other variations of balls-into-bins models for which the power of two choices 
has been analyzed (we refer to 1J] for a survey). For instance, Broder and Mitzenmacher 
considered a multidimensional version where balls correspond to 0/1 vectors. Godfrey 



analyzed balanced allocations on hypergraphs, extending the model of |12|] for allocations on 
graphs. Among other results, the author obtained a constant maximum load for an almost 
regular hypergraph if each hyperedge consists of d = ©(logn) bins, which is comparable to our 
result for dense graphs (Theorem II. 4|) . 

Local search is a generic method to solve optimization problems and several recent studies 
in algorithmic game theory involve local search-based processes. In contrast to our model, these 
processes usually start from a state where all tasks are assigned and allow, either sequentially 
or in parallel, tasks to be reallocated by using so-called improvement steps (a.k.a. selfish steps). 
For various settings, lower and upper bounds on the number of improvement steps until a Nash 
equilibrium is found have been shown 0, [l(| ■ We note that the total number of improvement 
steps in our process is always bounded by 0(n) with probability 1 — o(l), regardless of the 
underlying graph. 



2 Basic properties 

We start with some notation. Let G = (V, E) be an undirected, connected graph, where the n 
vertices represent n bins to which n balls should be allocated. For each node v & V, denote by 
xlp the load of v right after the ith ball is allocated. Thus, we initially have Xv = for all 

(n) 

v€V. Let XMk be the maximum load after n balls have been allocated; i.e., 



lW=maxlW. 

Let Ui £ V be the birthplace of ball i, so Ui is a uniformly random sample from V. Recall that 

— (n) 

in the 1-choice process, for all i ^ 1, ball i is allocated to Ui. For any v E V, let X v be the 
load of v after n balls are allocated according to the 1-choice process. In symbols, we have 



X<? ) = \{ie[l,n]:U i = v}\. 
With this, define the maximum load for the 1-choice process as 

-v( n ) _ ^v( n ) 

A max •— m ^-^v ■ 

Now, for two vectors A = (ai, 02, • • • , a n ) and A' = (a^, a' 2 , ■ ■ ■ , a' n ) such that Ya=i a i = 
Ya=i a 'i' we sa y A majorizes A' if, for each n = 1, 2, . . . , n, the sum of the k largest entries 
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of A is at least the sum of the k largest entries of A'. More formally, if ji,j2, ■ ■ ■ ,jn are distinct 
numbers such that aj 1 ^ aj 2 ^ • • • ^ a Jn and j[, j'2, ■ ■ ■ ■> j' n are distinct numbers such that 
a'/ ^ ol'ai ^ • • • ^ a'/ , then 

Jl J2 

A majorizes A' if aj i ^ a^/ for all k = 1, 2, . . . , n. 

i=i i=i 

The lemma below establishes that the load vector obtained by the 1-choice process majorizes 
the load vector obtained by the local search allocation. As a consequence, the maximum load 
of our local search allocation process is 0(logn/(loglogn)) for any graph. 

Lemma 2.1 (Comparison with 1-choice). For any fixed k ^ 0, we can couple and so 
that, with probability 1, X majorizes X 1 - ' . Consequently, we have that, for all k ^ 0, X m , AX 
stochastically dominates X^lx- 

Proof. The proof is by induction on k. Clearly, for k = 0, we have Xy°^ = X^ = for all v G V. 
Now, assume that we can couple X^ 1 ^ with X^ k ^ so that X^ k ^ majorizes X^^ . Now let 
ji, 32, ■ ■ ■ ,jn be distinct elements of V so that ^ Xj k ~^ ^ ■ ■ ■ ^ x\ k 1 \ Similarly, let 

j[,j 2 , ■ ■ ■ , j' n be distinct elements of V so that Xy ^ X^/ ^ ■ ■ ■ ^ -Xj-; . Now let £ be a 

uniformly random integer from 1 to n. Then, for the process {X^) ve v, we let the /cth ball be 
born at vertex jg and define 1 such that j L is the vertex to which the A;th ball is allocated. Note 
that, 1 ^ i. For the process [X v )„ 6 y, we set the birthplace of the /sth ball to j^. Therefore, 
for any k = 1, 2, . . . , n, we have 

£4? = £44' + 1 ( « > 5. E 4'-" + 1 ( « > > £ 4"* + 1 (- > = E 4'. 

i=l i=l j=l i=l j=l 



where the first inequality follows by the induction hypothesis and the second inequality holds 
since 1 ^ I. □ 

For any v G V, let TV,, be the set of neighbors of v in G. The next lemma establishes that 
the local search allocation always maintains a smoothed load vector in the sense that the load 
of any two adjacent vertices differs by at most 1. 

Lemma 2.2 (Smoothness). For any k ^ 0, any v G V and any u G N v , we have that \X^ — 
X (k) \ ^ 1. 

(k) (k) 

Proof. In order to obtain a contradiction, suppose that A4 ^ X„ + 2, and let j be the last 
ball allocated to v. Then, we have that 

= X™ + x^ + 1. 

Therefore, the moment the jth ball is born, vertex v has at least one neighbor with load strictly 
smaller than v. Therefore, ball j is not allocated to v, establishing a contradiction. □ 

For any vertex v G V and integer r ^ 0, let N£ be the set of vertices of G whose distance to 
v is exactly r (in particular, N® = {v}), and let B r v be the set of vertices of G whose distance 
to v is at most r; then B r v = U[=o ^v- Below we show that, if for a given v G V we have an 
upper bound for the number of balls allocated to _B£, then we obtain an upper bound for the 
load of v. 
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Lemma 2.3 (Upper Bound). Let v be an arbitrary vertex of G. Suppose that there exists an 
integer r ^ 1 and a positive \P, that may depend on n, such that the total number of balls 
allocated to the vertices of B T V is at most ^\B^\; i.e., J2ueB r ^ Then, we have that 

r I I 
i=0 1 1,1 

Proof. Note that, by Lemma 12.21 for any u G JVj, we have ^ — *■ Using this and the 
condition of the lemma, we obtain 

r r 

> ^ > ^(iw - i)\K\ = x^\b:\ - Y,i\K\- 

ueB r u i=o i=o 

□ 

Complementing the previous lemma, we now prove a lower bound on the maximum load 
that depends only on the number of balls born at a subset of vertices and the cardinality of a 
small ball around that subset. 

Lemma 2.4 (Lower Bound). For any subset Scy, let := ^r=i ^ ^ e ^ e num ber 
of balls born in S. Then, the maximum load (3 := X^lx satisfies the following inequality: 

where Bg := [j seS Bs is the set of vertices with distance at most j3 from S. 

Proof. If the maximum load is /3, then every ball born at some vertex u is allocated in Bu, and 
clearly the load of any vertex in Bg is at most /?. Combining these two insights yields 

and therefore the maximum load /3 must satisfy /3 ■ \Bg\ ^ $5. □ 

For the next two lemmas, we need ties to be broken either uniformly at random or by means 
of a fixed permutation of the neighbors of v for each v G V. The next lemma establishes 
that the load vector X^ satisfies a Lipschitz condition, which will turn out to be crucial in our 
proofs. 

Lemma 2.5 (Lipschitz property). Let k ^ 1 be fixed and m,U2, ■ ■ • ,itfc G V be arbitrary. Let 
(X^ k ') ve v be the load of the vertices of G after the local search allocation places k balls with birth- 
places ui,U2, ■ ■ ■ , life. Let i G {1, 2, . . . , k} be fixed, and let (Yy k ^) v ^y be the load of the vertices of 
G after the local search allocation places k balls with birthplaces u%, u?, . . . , Uj_i, u[, Wj+i, Ui+2, ■ ■ ■ , Uk, 

(k) (k) 

where u i G V is arbitrary. In other words, Yy is obtained from X„ by changing the birthplace 
of the ith ball from Uj to u\. Assume that, for both processes, the local search allocation breaks 
ties either uniformly at random or via the permutations {£v}vev described in Remark \1.2l Then, 
there exists a coupling such that 

^|X«-F«|<2. (2.1) 
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Proof. We refer to the process defining the variables as the X process, and we refer to the 
process defining the variables as the Y process. If ties are broken uniformly at random, 
then for each v £ V and i ^ 1, we define £{, to be an independent and uniformly random 
permutation of the neighbors of v. We use this permutation for both the X and Y processes to 
break ties when ball i is at vertex v. Then, since the first i — 1 balls have the same birthplaces 
in both processes, we have that 

X^-V = Y^ for all v G V. (2.2) 

Now, when adding the ith ball, we let V{ be the vertex to which this ball is allocated in the X 

process and v[ be the vertex to which this ball is allocated in the Y process. If V{ = vL then 

u\ u\ 

X y u' = Y£> for all u G V and fl2j} holds. More generally, we have that 

X®=Y® + l( Vi *vC), Y®=X® + l(vi?v$ and X® = Y® for u G V \ R 

i i 

(2.3) 

If i = ft, then this implies (|2.ip and the lemma holds. 

For the case i < k, we add ball i + 1 and are going to define Vi + \ and v' i+l so that (|2.3p 
holds with i replaced by i + 1. Then the proof of the lemma is completed by induction. We 
assume that Vi ^ v[, otherwise (|2.ip clearly holds. We note that Vi+\ and v' i+l will not be in 
the same way as Vi and v[. The role of Vi+i and v' i+1 is to be the only vertices whose loads in 
the X and Y processes are different. The definition of Vi+\ and v' i+1 will vary depending on the 
situation. For this, let ball i + 1 be born at Uj+i and define w to be the vertex on which ball 
i + 1 is allocated in the X process and w' to be the vertex on which ball i + 1 is allocated in 
the Y process. We can assume that w ^ w' , otherwise (j2.3|) holds with i replaced by i + 1 by 
setting Vi+i = Vi and v' i+l = v\. 

Now we analyze ball i + It is crucial to note that, during the local search of ball i + if it 
does not enter Vi in the Y process and does not enter v[ in the X process, then ball i + 1 follows 
the same path in both processes. Since we are in the case w ^ w', we can assume without loss of 
generality that ball i + 1 eventually visits Vi in the Y process. In this case, since the local search 
performed by ball i in the X process stops at vertex Vi, we have that Vi is a local minimum for 
ball i + 1 in process Y, which implies that w' = v%. (The case when ball i + 1 visits v[ in the X 
process follows by a symmetric argument.) So, since w ^ w' ', we have X ( l +1) = Y^ +1) . Then 
we let Vi + \ = w. If w = v' i} we set v' i+1 = w and (|2.3p holds since = Yu +1 ^ for all u G V. 

Otherwise we set v' i+1 = v[, and (|2.3|) holds as well. □ 



The following is a consequence of Lemma 12.51 

Lemma 2.6 (Monotonicity). Let k 1 be fixed and ux, U2, ■ ■ ■ , Uf- £ V be arbitrary. Let 
(Xv) ve y be the load of the vertices after k balls are allocated with birthplaces u\, 112, ■ ■ ■ , u^. 
Let i G {1, 2, . . . , k} be fixed, and let (z£ ) ve y be the load of the vertices ofG after ft— 1 balls are 

ii k) 

allocated with birthplaces u\,U2, ■ ■ ■ , iH—i, Ui+i, nj+2, • • • ,Uk- I n other words, Z-b is obtained 

(k) 

from X{, by removing ball i. Assume that, for both processes, the local search allocation breaks 
ties either uniformly at random or via the variables (^ v )veV described in Remark \1.2l Then, 
there exists a coupling such that 

V \XW - 7^ k) \ = 1 
v&V 

Proof. Let G' be the graph obtained from G by adding an isolated node w; i.e., G' has vertex 
set V U {w} and the same edge set as G. Applying Lemma I2.5I to G' with the same choice of 
u%, . . . , Ufr G V and with u' i = w gives 

V \x^ -y( fc ) 
/ j y v v 

vGVU{w} 
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Since Y, 



(k) 



1, X« 



and Z, 



(i,k) 



(k) 

for any v E V, we conclude that 



J2\xw-zw\ = 52\x 



(k) 
v 



□ 

We now use Lemma 12.61 to prove a type of subadditivity property. In a simpler statement, 
we show that, if for k independent of copies of the local search allocation with m balls the 
maximum load is at most x, then the maximum load obtained after placing km balls via local 
search allocation is at most kx. 

Lemma 2.7 (Subadditivity). For any 1 ^ z ^ n, and any x ^ 0, it holds that 



Pr 



X^L > \n/z] ■ x ^ \n/z] • Pr \ ^ 



(n) 

Proof. Let U\, U%, ■ ■ ■ , U n be independent uniform random samples from V. Then, Xmax is 
the maximum load after n balls are added to G with birthplaces U\ , U% , . . . , U n . We define 
k := \n/z] independent copies of the local search allocation, where in the first copy we allocate z 
balls according to the birthplaces U\, U2, ■ ■ • , U z , in the second copy we allocate z balls according 
to the birthplaces U z+ \,U z+ 2, ■ ■ ■ ,U2 Z and so on and so forth. Hence, in total we allocate 
\n/z~\ ■ z ^ n balls. Let Mi, M2, • • • , be the maximum load of each copy, respectively, after 
z balls are allocated. Then, we claim that XmsL ^ ^mm ^ /Li=i ^i- This follows since, for 
each copy i, after allocating the z balls, we can successively add more balls in such a way that 
all vertices have load exactly Mj in copy i. Then, by taking the union of all copies, we obtain a 
balls-into-bins process with Y2i=i Mi balls, n of which have birthplaces Ui, XJ2, ■ ■ ■ , U n . Then, by 
Lemma |2.6( the maximum load in this process, which is Yli=i Mi, is at least Xmax- Therefore, 



Pr 



X£l>\n/z].x 



< Pr 

< Pr 



V fc M t > \n/z] ■ x 
* — '1=1 



CkVr 



X^ > x 

"■max ^ 



□ 



3 Expander graphs 

In this section we give the proof of our main result, Theorem 11.11 which establishes an upper 
bound for the maximum load after n balls are allocated to the vertices of an expander graph. 
In fact, we can prove this theorem in a more general setting. As before, for each u G V and 
r = 1, 2, . . ., we define N£ to be the set of vertices of V whose distance to u is exactly r, and 
B r u to be the set of vertices of V whose distance to u is at most r; in symbols, 

r 

N£ = {v £ V : graph distance between u and v is r} and B r u = TV*. 

i=0 

We say that G has exponential growth if there exists a constant <ft > so that 

|-B^| ^ min |exp(0r), — | for all u € V and r ^ 0. (3-1) 

Note that any graph with exponential growth has a diameter of O(logn). Moreover, if G is an 
expander, then it has exponential growth. Therefore, Theorem 11.11 follows from the theorem 
below. 
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Theorem 3.1. If G has the exponential growth property defined in (|3.1|) and bounded degrees, 
then there exists a positive constant C so that, as n — )■ oo, 



Pr XW > C^g log n 



We devote the remainder of this section to prove Theorem 13. 11 We start with a high 
level description of the proof. We claim that, for any vertex v and some properly chosen 
r = O (log log n), 



i.e., with very high probability, the number of balls allocated to the vertices of B^° is at most 
C\B^°\. Having established (|3.2|) . the proof follows immediately by applying Lemma 12.31 and 
taking the union bound over all v. Now, in order to prove (|3.2p . we use that Lemma 12.51 
establishes that the load of the vertices satisfies a Lipschitz condition; i.e., if the birthplace 
of one ball is changed, the load vector can only change in two vertices. Therefore, we can 
apply Azuma's inequality to bound the probability that Yl U £B r ° ^ C|_B^° | ; however, for 
ro = O (log log n), the probability bound obtained via Azuma's inequality is not small enough to 
take the union bound over v £ V later. Nevertheless, Azuma's inequality gives a small enough 
bound when applied to any radii larger than some R ^> ro- Then, the idea is to control the 
number of balls allocated to B^" 1 by using the bounds obtained for all radii r ^ R, and then 
apply an inductive argument to finally establish (|3.2[) , 

The main intuition why this analysis works is because a ball can only be allocated inside 
B r v if the ball is either born inside B r v or it is born in a vertex u at distance j to B T V but whose 
load, at the moment the ball is born, is at least j. This is true because, at each time a ball 
moves from a vertex u to a vertex v! G N u during the local search, the load of u must be 
strictly larger than the load of u' . In other words, the ball traverses a load decreasing path 
from its birthplace to the vertex on which the ball is placed. Therefore, for a ball allocated in 
B r v , the larger the distance between the birthplace of this ball and B r v , the smaller the number 
of possibilities for the birthplace of the ball since these vertices must have a large load at the 
moment the ball is born. This, in a high-level description, gives that if we change the birthplace 
of a ball to a uniformly random vertex, the load of the vertices inside B r v does not change with 
high probability. This allows us to control the variance of the Lipschitz condition and apply a 
more refined version of Azuma's inequality to move from radius r to radius r — 1 inductively 
until we reach radius ro- We remark that we actually need to control not only the number of 
balls allocated to B r v for all r G [ro,R], but also the number of balls allocated to nodes in B r v 
whose load is at least t for all r G [?"o,-R] and many values of i. We defer the details for the 
rigorous argument below. 

Now we proceed to the rigorous argument. We start by showing that the load at any given 
vertex has an exponential tail. 

Lemma 3.2. Let v be any given vertex ofV and let A be the maximum degree of G. Then, for 
any z ^ 8eA, 



Proof. We start defining a sequence of vertices wq,w\, . . . and time steps to > t\ > ■ ■ ■ such 
that, for every j ^ 1, ball tj is born at vertex Wj and allocated to Wj-i. We start by setting 
wo = v and to = n. Inductively for j ^ 1, we let tj be the last ball allocated to Wj—i before 
time tj-i and set Wj to be the vertex at which ball tj is born. So, for j = 1, t\ is the last 

(n) 

ball allocated to u^o and w\ is the vertex where ball t\ is born. Note that, whenever X{, ^ z, 
if w\ = wo, we know that xjjfj ^ z — 1. On the other hand, if w\ / wq, then we have that 



Pr 




(3.2) 
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Xwl ^ z + d(wo,wi) — 1, where d(u,v) is the graph distance between u and i>. More general, 
for all £ ^ 1, we have that 

4^^+E^- 1 ^)- 1 )- 

We continue this procedure until we find a value of £ such that X^f = 0. Note that, for each 
j, we have X^p — X&Zl ^ — 1; consequently, we can have X^P = only for I ^ z. In 
order to obtain an upper bound for Pr X„ ^ z , we apply the first-moment method over 
all possible sequences (w\,W2, • • • ,wg) £ V and t\ > t<2 > ■■■ > tg, for every £ ^ 1, such that 
X^iC^O^j-is ^j) — 1) ^ — z - With this, we have 



Pr [iW ^ 

\ r , 

^ yjC^C^J— 1' w i) — 1) ^ — z I f r P) . {ball tj is born at Wj} 

e^z ™i.»2.-^ \j=i I 

^E E ifE^K-i'^)- 1 )^- 2 )^- ( 3 - 3 ) 

^2 «"l»«2>-'°* \i=l / 

Let Aj = Wj)— 1 G {—1, 0, 1, . . .}. For any fixed £, we can estimate the number of possible 

sequences by counting the number of possibilities to choose \j and tj so that Ylj=x <\j ^ ~~ z -> 
and then counting the number of possibilities to choose the Wj accordingly. Clearly, there are 
at most (") ways to choose the tj. Let k z be the number of values of j for which Xj = — 1. 
Then, the other £ — k values of Xj are all non-negative and must sum to at most k — z. With 
this, we can bound above the number of choices for the tj and Xj by 

k — z k — z 

Once the Xj are fixed, the number of choices for the Wj is at most 

t 

JjA^ +1 <A <_ *. (3.5) 
i=i 

Plugging the estimates in (|3.4p and (|3.5p into (|3.3fl . we have 

}2£-* A 1-2 



Pr 



v ^ 1 v-/e-n\^ 2 2e - z A e - z v-^ / 4e • A \ v-/4e-AV /4e-AY 



where the last inequality uses the fact that z ^ 8e • A. □ 

Throughout the section, we fix an arbitrary vertex v and bound the number of balls allocated 
to the vertices of B r v for all rg ^ r ^ R, where 



r := 



min|r: \B r v \ ^log 10 n| and i?:=min|r: \B^\ ^ T^}' ( 3 -6) 



We will consider the balls (B^) r ^ ro , and will bound the number of vertices in B r v with load at 
least £ for all integers £ 6 [A)>^i] ; where 

ft = 8.. A' ffid ft-ij^. (3.7) 
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Then, for all r and £, define 

A r , e = {ueB r v :XW >£}. 

In order to control A r ^, we will need to estimate the probability that the nth ball changes the 
load of vertices in B r v . For this last value, we need to control the load of the vertices after n — 1 
balls have arrived. Then, we define 

Note that A*^ C A r ^. By Lemma 13.21 we have that, for £ ^ £q and any r ^ 1, 

£+k / -, \ i+k 



E| 



Next define 



/4p • A\ + / 1 \ + 

U fc>0 K +k/+ k\ } < £ |^ r+fc |2 — - < \B r v \ ^ A fc 2 — < 4(2A)-^|- 

(3.8) 



hence, A£ = IJ^Li A* t an d define the event 

log n 



n{|AH^.(2A)- + ^}. (3.9) 



From now on let J-i be the c-algebra induced by the configuration obtained after i balls are 

placed. More formally, if Ui, U2, ■ ■ ■ , Ui are the birthplaces of the first i balls and, for each v £ V 

(i) 

and j = 1,2, ... ,i, we define fir to be an independent uniformly random permutation of the 
neighbors of v, where {£v^}vev are the permutations used to break ties uniformly at random 
for ball j, then T{ is the cr-algebra induced by U\, . . . ,Ui and {£^}ueV,i^;Ki- 

Lemma 3.3. Let v £ V be fixed, and let R and L* R be as defined in (|3.6p and (|3.9p . respectively. 
Then, there exist uq so that, for all n ^ n$, we have 

Pr[L R ] ^ l-2n- log5n . 

r n \ 

Proof. Recall that by Lemma 12.11 there is a coupling such that with probability 1, X ma- 
jorizes X^ n \ Hence the claim follows directly by Lemma lA . 41 and a union bound over all £ with 
£ ^£^h. □ 

For any r with 1 ^ r < R and any £ ^ £q, in order to bound the number of vertices in A r ^, 
we will look at the probability that the n-th ball affects A r ^. In other words, we control the 
probability that A r ^ is different from A*^. Note that it is only possible that A r £ 7^ A*, if the 
n-th ball is born at a vertex of A*^_ 1 or if it is born at a vertex of N 7 v +k with load at least 
£ — 1 + k for some k ^ 1; we shall bound this last set of vertices by A r+ k,e~i+k- We define 
inductively for r < R 

and, for £ ^ £q, 

L* 4 = L*,_ x n {|A^| < 8|K|(2A)^ + (3.10) 

The next lemma establishes that, with high probability, the last ball cannot affect the load 
of a small set of vertices. 

Lemma 3.4. Let v be any fixed vertex, r ^ 1 and £ ^ £q. Then, there exists a positive constant 
c = c(A) such that 



Pr 



U U6A * {^v^) 



c|f?£|(2A) 3 log n log log n 

n n 
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Proof. Note that the n-th ball can only change the load of a vertex in A*^_ x if it is born at a 

vertex of B T V of load at least £ — 1 or if it is born at a vertex u £ N^ +k with X^ 1 ^ ^ £ — 1 + 
for some k ^ 1. Fixing any realization for the birthplaces of the first n — 1 balls, and thereby 
fixing the sets A*^ for all r and £, we have the following upper bound 



Pr 



at 'V,(-i 



I A* I Ml 00 A* I 
< \ lv r,e-i\ _|_ I Ufc=l 1 \+k,l-l+k\ 



n 



n 



where the probability above is taken over the choice of U n only. Note that L*^_ 1 is measurable 
with respect to J- n -\ since the birthplace of the n-th ball is independent of any event in J- n -i- 
Then, for i > £q, we obtain 



rl-l 



8|^|(2A)- m + if^ ^ 8 \B r v +k \(2A)- e+1 - k + 8\B*\(2A)- £ - R+r + 



n 



+ E 



fc=i 



where the last term comes from L\. Using the bounds \B r v +k \ \B" v \A k and |.B*| ^ |.B£|A i *- r , 
we obtain 



Pr 



r,4 



8|^|(2A)" 



e+l _|_ log 7 w fl-r+1 



J? 



-+ E 



B£|A fc (2A)- 



8|BS|(2A)- 



fe=i 

m + ^f | 16|f%|(2A) 



n 



— + E 



log 7 n 



+ 



k=0 

2 log 7 n log log n 



(£ - 1 + fc)n 



n 



24|5£|(2A) 



n 

^ 3 log 7 n log log n 



n 



n n 
For ^ = £q, we simply bound |A*^ _ 1 | by \B^\, which gives that 



Pr 



\Bl 



n 



R-r+1 



+ E 



|S;|A fc (2A)-^ +1 - fe lo ; 



fc=l 



+ E 

fc=0 



n 



(£ -l + k)n 



< 16| J B;|(2A)-^ +1 + 2 log 7 n log logn 



n 



Now, for any i ^ 1, r S [ro, -R] and ^ G [^o>^l]> we define T l r £ CV l as the set 



□ 



= <j ( Ul ,u 2 , ....u,)< I'': Pr [L* £ | f^lj^ = M 



^ 1 ? for all j = 1,2, . . . ,i> . 

n z J 

(3.11) 

Intuitively, for any given r, £ and i, the set T l ri contains the good birthplace for the first i balls 
so that the event L*^ is likely to occur, conditioning on any prefix of the birthplaces. 

While Lemma 13 41 considered the effect of the last ball n, the following lemma studies the 
effect of replacing the birthplace of ball i by a randomly chosen bin. 
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Lemma 3.5. Let i, i, r and v be fixed. Let f : V n — )• Z be an increasing function that depends 
only on (X^) v / e ^ ri and is 1-Lipschitz. Let Wo, W\, . . . , W n —i and Wq be i.i.d. random variables 
chosen uniformly from V . Then, if (tti, U2, ■ ■ ■ , ^i-i) £ ^tf, we have 



Pr 



f(ux, Ui-t, W , Wi, . . . , W n -i) ^ f(m, Ui-i,W , Wx, . . . , W n -i) 

~2 



6c|ff£|(2A)~ m 18 log' re log log n _5_ 
n n n 



where c is the constant from Lemma \3.4 



Proof. In this proof all the probabilities are taken conditional on Uj = Uj for all j = 1, 2, 



but we will omit this dependence from the notation. The idea is to relate the probability above 
to 



Pr 



/(til, . . . , Uj_i, Wi, W n -i, W ) + /(til, • • • , Ui-i,Wi, Wn-i, Wo) 



which corresponds to changing the n-th ball instead of the t-th ball; this will allow us to apply 
Lemma 13.41 

Consider the three events below: 



Ei = 


{/(«!,■■ 


. ,Ui-i,W ,Wi,. . . 


W n -i)^f( Ul ,.. 


,Ui-i,Wi,.. 




-i,W )} 


Ei = 


{/(«!,.■ 


. ,Ui-i,W ,Wi,. . . 


W n -i)^ /(«!,.. 


,Ui-i,Wi,.. 


■ ,w n . 


-i,W )} 


Eend = 


{/(«!,.- 


. ,Ui-i,Wi,. . . ,W n 


-i,W )^ /(«!,.. 


,Ui-i, Wi, . . 


-,W n - 


-i,W )}. 


Clearly, /(tti, 


. . . , Ui-l 


W ,Wi,...,W n -i) 


and f(ui, ... ,Ui- 


-i,W ,Wi,.. 


.,w n . 


-i) can o 



different if at least one of E\, Ei or -E cn d happen. Therefore, we can write 



Pr 



/(til, . . . , Ui-i,W , Wi, . . . , W n -i) + /(til, . . . , Ui-i, Wq, Wi,..., Wn-i) 



^ Pr [ Ei ] + Pr 



Ei 



+ Pr[E eDd ]. 



(3.12) 



We start with the term Pr [E eri( i ]. Let I e nd be the event that L* t _ x happens given that the 
birthplaces of the first n — 1 balls are according to the sequence (m, . . . , Ui-i, Wi, . . . , W n -i)- 

(n) 

If Yu is the load of vertex u when n balls are added with birthplaces tti, ti2, . . ., tij_i, Wi, 

^ (n) 

W2, ■ ■ ■, W n -i, Wo and Yu is the load of vertex tt when n balls are added with birthplaces ui, 
ti 2 , • • ., Ui-i, Wi, W 2 , ■ ■ ., W n -i, Wo, we have that 



Pr[£ end KPr[I e c nd ]+Pr 



+ Pr 



U + y W} n {yW > £}) n / ( 



ieBT, 



'end 



|J ({Y^ ± fW} n {yW > £}) n / end 



(3.13) 



where Pr [/ e c nd ] ^ re~ 2 by (pmi) since (m, . . . , u;_i) G T^ 1 and Wi, . . . , W n _ ,• are i.i.d. uniform 
samples from V . The other two terms in equation (|3.13p can be bounded by Lemma 13.41 

Now it remains to bound Pr [ Ei ] , since by symmetry we have Pr [Ei] = Pr Ei . In order 

to bound Pr [Ei ], we consider all cyclic permutations of (tti, . . . , Ui-i, Wo, W%, . . . , W n -i)- More 
specifically, we first compare 

/(tti, • • • , Ui-i, W , Wi,..., W n -i) with /(tti, • • • , Ui- 1: Wi, W 2 ,..., W n -i, Wo), 

then we compare 

/(tti, . . . , m-i, Wi,W 2 ,..., W n - U Wo) with /(tti, • • • , Ui-i,W 2 , W 3 , . . . , W n -i, Wo, Wi), 
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and so on and so forth until we compare 



/Oi, . . .,Ui_i,W n -i, W , Wi, . . ., Wn-i-i) with /(ui, . . . ,Ui-!, W ,Wi,. . . ,W n -, , 



In order to do this, we define a graph H whose vertex set is {0, 1, . . . , n — i} x v n ~ l+1 ; so the 
vertices of H have the form (j, zq, Z\, . . . , z n -i). We let each vertex of H have exactly one outgo- 
ing edge and one incoming edge by having a directed edge from each vertex (j, zq, z\, . . . , z n ^i) 
to (j' , zi, Z2, ■ ■ ■ , z n -i, zq) where j' = j + 1 (mod n — i + 1). With this, the sequence 

(0, Zq, Z\, . . . , Z n —i), (1, Z\, . . . , Z n —i, Zq), . . . , (J, Zj , . . . , Z n —i, Zq, Z\, . . . , Zj— i), . . . , 

(/l i, Z n —i, Zq, • • • , Z n —i—\), (0, Zq, Z\, . . . , Z n —i) 

forms a directed cycle in H. Since each vertex has in-degree and out-degree equal to one, we 
have that each connected component of H is a directed cycle with n — i + 1 vertices. Now, 
for any vertex (j, zq,z\,. . . , z ri _i) of H, we say that the edge from (j, zq,Z\, . . . , z n -i) to (j + 
1, z\, Z2, ■ ■ ■ , z n -i, zq) is f -increasing if 

f(ui,U 2 , ■ ■ ■ ,Ui-\,ZQ,Z\, . . . ,Z n -i) < f(ui,U 2 , ■ ■ ■ ,Ui-l,Zl,Z 2 , ■ ■ ■ ,Z n -i,Zo). 

Similarly we say that the edge is f -decreasing if the opposite inequality holds: 

f(ui,U 2 , ■ ■ .,Ui-\,ZQ,Z\, . . . ,Z n -i) > f{ui,U 2 , . . . ,Ui-l,Zi,Z 2 , ■ ■ ■ ,Z n -i,Z ). 

Since / is 1-Lipschitz and integer- valued, for any directed cycle, the number of /-increasing 
edges is the same as the number of /-decreasing edges, and therefore the number of /-increasing 
edges in H is the same as the number of /-decreasing edges. 

Note that choosing a sequence (Wq, Wi, . . . , W n -i) uniformly at random from Y n ~ %Jrl and 
a number k uniformly at random from {0,l,...,n — i} gives a uniformly random vertex of H. 
Similarly, if we choose a vertex (j, zq,z\,... , z n -i) uniformly at random from the vertex set of 
H, then (zo, z\, . . . , z n _i) is a uniform sample from V n ~ l+ . Since each vertex of H has a unique 
outgoing edge, one can choose a uniformly random edge of H by choosing the outgoing edge of 
an uniformly random vertex of H. 

We fix a uniformly random vertex (j, zq,Z\,... , z n -i) of H and let e be its unique outgoing 
edge; i.e., e is the edge from (j, zq, z\, . . . , z ri _i) to (J + 1, z%, . . . , z n -i, zq). Therefore, 

Pr [Ex] ^ Pr [e is /-increasing] + Pr [e is /-decreasing] = 2Pr [e is /-increasing] . (3.14) 

(k) 

Let Yu be the load of vertex u after k balls are added with birthplaces given by the sequence 
(u\, . . . ,u,- t -\, zq, . . . , z n -i) and let Yu be the load of vertex u after k balls are added with 
birthplaces given by the sequence {u\, . . . , Z\, . . . , z n -{, zq). 

Note that, by Lemma 12.61 removing a ball cannot increase the load of any vertex; this gives 
that 

^ Y^ for all u G V. 

Therefore, e can only be /-increasing if ?i n) ^ yj n 1] and yj n) ^ i for some u £ B r v . Let I e 
be the indicator that L* i _ 1 holds given that the birthplaces of the balls are according to the 
sequence (ui, . . . , Uj-i, Z\, . . . , z n -i, zq). Hence, using (|3.14p . we have 



Pr [Si] ^ 2Pr 



[J ueBr ({FtM * n (n) > n {yj n) > i}) I i, 



+ 2Pr[i; 



By equation (|3.11|) . Pr [ Jg] ^ n 2 and the other term can be bounded by Lemma 13.41 Then, 
we put this and (|3.13|) into (|3.12[) to complete the proof. □ 
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Now we will use an inductive argument to bound the probability that L r (_ happens for all r 
and I. 

Lemma 3.6. There exists a positive constant c' such that, for any v £ V , r < R and any 
integer I G we have 



Pr 



\A r/ \ > 8\B r v \(2Ay e + 



log n 



< exp(- log 4 n) + n 2 • Pr [ (L*^_ t ) 



for all large enough n. 

Proof. Recall that, by Lemma 13.21 we have 



E[|A r/ |] ^2\B r v \ 



4e • A 



< mb:\ (2A)- 



Define the stopping time 



r = n A min{j : (U u U 2 , . . . , Uj) ? 3 rl }, 



where for any two numbers a A b := min{a, b}. Let Xi = E [|A r ^| | Ti\ and Zi = -Xj Ar . Note 
that Zi is a martingale since r is a stopping time. Moreover, r is a bounded stopping time and, 
using the optional stopping theorem, we have that E[Z T ] = E[Zo] = E[Xo]. We want to 
bound the conditional variance of Zi — Zi—\ uniformly over all i, which is given by 

Var^ [ Z, - Zi-t I ]=B Ut [ (Zi - Zi_!) 2 | Ji_ a ] - (Eu. [ Zi - Zi_ x \ \) 2 

= E^ [ (Zi — Zi-!) 2 | ] , 

where the variance and expectation are taken over the choice of Ui only. Now, we write 

E?7j [ (Zi — Zi-!) 2 | Fi-i ] 

= E v . [ (Zi - Z,-!) 2 ! (t > i) | J-i-i ] + E Vi [ (Zi - Z,-!) 2 ! (r<i)\ Ti-! ] 
= E Ui [(Zi-Z i -!) 2 l(r^i)\T i -!] , 

since Zi = Zi-! whenever r < i. We want to bound 

.i-l 



E 



Vi [(Z i -Z i -!) 2 \f]. =i {U j = u j } 



uniformly over all (u!,U2, ■ ■ ■ , Uj-i) G ^r/ - ^ n ^ e * ne va l ue °f when Ui = u and let 



E, 



u&V 

Since Xi is 1-Lipschitz we have |C« — Cm'I ^ 1 f° r an "u, - "' £ V. With this, we can write 

i£(<«-c) 2 ^£i<»-<i4£l£^K»-c.') <4££ic,-w. 



Now, using Lemma 13.51 we have that 



n i — ' I 1 — ' n 

uev u'ev 



u&v u'ev 



1 V" i/- a i ^ 6c\B r v \(2A)- e+1 18 log 7 n log log n 5 



n 



uev u'ev 



n 



n 



ii- 



which gives that 



r 2| 1 6c|j?;|(2A)-^+ 1 18 log 7 n log log n 5 



n 
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uniformly over i. Now, note that, for any A > 0, 



Pr[\X n -X \ > A] ^Pr[{|X„-X | > A} n {r^n}] + Pr [r < n] 
= Pr[{|Z n -Z | > A}D{r>n}] + Pr[r<n]. 

Also, we have that for any 1 ^ i ^ n, 
Pr[(^-i) c ] 

= ^ Pr [ (L r/ _ 1 ) C n {(£/!, . . . , U j0 ) ? T*> _J n (nt^^i' • • • > ^) 6 T rVii 
+ Pr [ (L^r n (flU^^ 1 ' • • • ' Uk) £ T -Vi}) ] 

This gives that 

Pr[r<n] ^Pr [([/!,...,[/„_!) 0T^] ^ n 2 • Pr [ {L* r/ _ x ) c } . 

Then, applying the version of Azuma's inequality from Lemma I A. 21 we have 

log 7 n 



Pr 



| Aw| >8|Sj|(2A)-« + 



^ Pr 



^ exp 



\Z n -Z \ >4|^|(2A)- f + 



log n 



(4| J B;|(2A)- £ + ^) 2 



14c|SJ|(2A)-^ + 371og 7 nlog logn 



+ n 2 -Pr[(L*,_ 1 ) c ] 
" + n 2 -Pr[(L*,_ 1 ) c 



If |BJ|(2A) e ^ log 7 n log logn, the exponential term above is at most 



exp 



(4|b;|(2A)- 



(14c + 37)|^|(2A)~V " ~ r \ 14c + 37 
otherwise we bound above the exponential term by 



< ( . xp | - 16| 5J (2 1 ) " ) < exp (-log 4 n) ; 



exp 



(14c + 37) log 7 n log log ny \ (14c + 37)£ 2 loglogn 

where the last inequality holds for all large enough n since t ^ t\ = 0(logn 



log n 



^ exp (— log 4 n) , 



□ 



Proof of Theorem 13.11 First note that the lower bound on the maximum load is established 
by Theorem 11,61 So we now prove the upper bound. Let v be an arbitrary vertex of V. We 
start the proof by showing that there exist positive constants C and c such that, for all large 
enough n, it holds that 



Pr 



*< n) > Wl 

ueBl 



^ exp (—clog 3 n) 



(3.15) 
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Then, it follows by Lemma 12.31 that 



i=0 



Note that YH=o \b"°\ is the average distance between v and a vertex in B^°, which is at most 



7*0. Therefore, we obtain ?J C + Tq. By the definition of ro, we have |-B£°| Alog iU n 

Combing this with the exponential growth property of G yields 

r < \\og\B r v °\ = 0(loglogn), 



where <f> is the parameter defined by equation 
It remains to establish (|3.15p . First note that 



E = 53 1^1 < i w + E i^wi 



Therefore, we have that 
Pr 



E x « n) > c \ B v° 

ueB r v ° 



< Pr 



Now, if |A ro/ | < 8|S^|(2A)- < + ^ for all £ = £ ,£ + 1, . . . ,4, then £~ £q \A n J < (C - 
^o)|-BJ°| for some large enough C since \B^°\ ^ log 10 n. For any given £, using Lemma 13.61 we 
have 

l ° R '" 1 <exp(-log 4 n)+n 2 .Pr[(^ ^ 1 ) c ]. (3.16) 



Pr 



|A nj ,|>8|^»|(2A)-' + 



By definition of L* ri (cf. (|3.10p ). we have that 



Pr[(L*^_!) c ] ^Pr 



< Pr 



|A^„ 1 |>8|^|(2A)-^ + ^ 
|A ro ,_ 1 |>8|^|(2A)-^ + ] ^ 



+ Pr[(L^_ 2 ) 
+ Pr[(L^,_ 2 ) 



< exp(- log 4 n) + (n 2 + l)Pr [ (L* q/ _ 2 ) 



where the second inequality follows since A* Q C A r0j £_i and last inequality follows from 
Lemma 13.61 Applying this into (|3.16|) , we have 



Pr 



|A ro ,,|>8|iC|(2Ar £ + 



log 7 n 



< exp(- log 4 n) + (n 2 + 1) exp(- log 4 n) + (n 2 + l) 2 Pr [ (£* o/ _ 2 ) 
e-io 

^E^ 2 + 1 ) fe ex p(- lo s 4 n ) + ( n2 + l)'^ 0+1 Pr [ (L* ro+lA ) c ] . 



k=0 



Using the same argument, we obtain for any r that 



p r [(L^) c ] < E( n2 + 1 ) fcex p(- 1 °g 4n ) + ( n2 + 1 )' 1 "' 0+lpr [( L W 1 ) c ] 

fc=o 

R-r-lli-to 

< E E (n 2 + l)^ lj+fe exp(-log 4 n) + (n 2 + l/ 1 ^~ r )Pr[(L^) c 

jr=0 fc=0 
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Using Lemma [57 

Pr[(L^) c ] < 2n" log5n , 

and plugging this into (|3.16|) . and using the union bound over £, we obtain that 

log 7 n " 



Pr 



u2jiiwi><wi<5*r'+ !2 v s } 



^ exp(— clog re) 



for some positive constant c, which establishes (|3.15p . 



□ 



4 Grid graphs 

In this section, we analyze the maximum load of the local allocation process on any d-dimensional 

grid, where d is an arbitrary constant. We show that the maximum load is ©( ( ^ ^ — j 
Interestingly, the analysis on the grid turns out to be much easier than the analysis on expander 
graphs, as on grid graphs the number of paths the local search could follow is much smaller. 

Formally, we define the d-dimensional grid by the vertex set V = {u : u = (u\, . . . , Ud), Ui = 
0, . . . , n l / d — 1} and edge set E = {{u, v} : dist(u, v ) = 1}, where 



dist(u,v) = dist(uj, Vj), and dist( 



u i: Vi) = mm< \m - Vi\,n 



l/d 



Ui - Vi 



1=1 



}• 



Proof of Theorem 11.31 We start with the upper bound. Within this proof, we use the follow- 
ing notation: 

B r u := {v G V : dist(uj, Vi) ^ r, Vi = 1, . . . , d}. 

Roughly speaking, B r u can be seen as the "^-version" of the set B r u used in Section [3l Note 
that for any r ^ n 1 ' d /2 — 1/2, \BJ^\ = (2r + l) d . We first define an event that gives an upper 
bound for the number of balls born in BL for various u and r: 



f; =n 



n 



E z * n) < p( r 



where p(r) := 4e • (d + l)d^j)1 
born on v during the first n rounds 



1 • (3r) d and Z^ n ' = Y17=i = v ) ls * ne number of balls 



To prove that £ holds with high probability, fix any vertex u G V and r ^ (Ad) d ( ■ lo - gn hi; 



We have 



* log log n i 



Pr 



E z v n) > 



n 



\Bi\ 



p{r) J \ n 



p{r) 



e • n 



p(r) 



(2r + If 



d \ p(t) 



n 



e • (3r) 



4c -( d + 1 )(l^7) 3TT -(30° 



4e-(d + l)( K fe)^-(3r)<* 



n~ 3 . 



Taking the union bound over the n vertices and at most re/2 possible values for r yields Pr [£] ^ 
1-n -1 . 
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Assuming that £ occurs, we now infer the upper bound on the maximum load. Assume for 
the sake of contradiction that the maximum load is in the interval [a/2, a] where a is any value 



larger than 16e • (16d) d (d + 1)(- 



logn 



d+l 



log log n > " " • u £ V he & vertex with X^ 1 ' £ [a/2, a]. Since 
the maximum load is at most a, only balls that are born in B^ a can reach 5°. Since £ occurs, 
we know for r = 2a that 

l/(d+l) 

• (6a) d . 



^4e-(d+l 

On the other hand, if u has load at least a/2, then 

z (n) > ga/(id) 



log n 



log log n 



(4.1) 



2" d -4d 



a 1 



a 



2d/ 



> 4e • (d + 1) 



4 (16d) d 
log n 



4a 



4a 



(4.2) 



v log log n , 

where the last step used our lower bound on a. The desired contradiction follows now from 
(|4.ip and (|4.2p . and the proof of the upper bound is complete. 

Now we proceed to establish the lower bound. It is a well-known fact (cf. [lil . Lemma 5.12]) 
that with probability at least 1 

are born. Applying Lemma 

(3: 



-l 



log n 



balls 



- n there is a vertex u 6 V on which at least — , 

' log log n 

with S = {u}, = lo^fogn implies that the maximum load 



-Xmax satisfies 



logn 
log log n 



Hence, as \Bu\ ^ \Bu \ ^ (2/3+l) d and d is a constant, we obtain that /3 = Q 



log n 
log log n 



l ■ 
d+l 



□ 



5 Dense graphs 

In this section, we analyze dense graphs which we define as graphs where the minimum degree is 
S7(logra) and the ratio between the maximum and minimum degrees is constant. This includes, 
for instance, the log re-dimensional hypercube and Erdos-Renyi random graphs with average 
degree (1 + e) logn, for any e > 0. The key idea of the analysis is that as long as less than a 
balls are allocated, where a < n, every vertex has a constant fraction of neighbors which have 
received no ball, and hence, the maximum load is bounded by 1. Lemma |2 . 71 implies then that 
after n balls are allocated, the maximum load is at most n/a. To make the analysis work, we 
need to assume that ties are broken uniformly at random; i.e., whenever a ball has more than 
one vertex to be forwarded to, the vertex is chosen independently and uniformly at random 
among the set of possible vertices. 

Proof of Theorem 11.41 We divide the process of allocating the n balls into different phases, 
where each phase allocates a batch of consecutive a ^ n balls (hence the number of phases 
is [n/a]). Then, by subadditivity (cf. Lemma l2.7p . the maximum load at the end is at most 
[n/a] times the maximum load of a single phase. 

Let G be an almost regular graph with minimum degree 5 = c ■ log n and maximum degree 
A ^ C ■ 5, where c > is any value bounded below by a constant and C ^ 1 is a constant. For 
any load assignment of the vertices {pcu)u€i V, we define the exponential potential as: 

<^((x u ) u( z V ) := ^2 ex P I a " Xv ) ' 

uev V veN u I 
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where a := max{41og(ra)/5, 1} = 0(1). We also define for any 1 ^ t ^ n, 



hence = n. Our goal is to bound the expected multiplicative increase in compared 
to $>w. i n order to do that, we will actually also assume that is small. 

Specifically, assume that (x u ) ue v be any vector in (NU{0}) n such that $>((x u )uev) ^ n-e" 5 / 2 
and suppose that the load vector at the end of round t is (x u ) ug y, i.e., X® = x. Then this 
implies for every vertex u £ V, 



exp ( a ■ E < n • e 1 



5/2 _ e logn+5/2 



and consequently, X^ga^ ^ (V* 7 ) ' (l°g n + ^/2) ^ S/A + <5/2 = (3/4)5. Hence, there are 
at least deg(it) — (3/4)5 ^ (1/4)5 neighbors of u which have no ball, where deg(it) is the degree 
of vertex u. In particular, this implies that the next ball t + 1 will be allocated either on its 
birthplace or at a direct neighbor. Therefore, 



Pr 



j2 4 t+i ) = e 4 m) + 1 1 x(t) = ■ 

t)6JV„ veN u 

Pr ball t + 1 born at to and allocated on w | X^ = x 

vGN u weN v U{v} 

1 4\ l\ 5CA 



X/ ( X) ( n 5 ) + n ' 



(5.1) 



This yields, 

E [<J>(* +1 ) I X<*>=x" 

f £ = ^ iW + 1 | X® 



Pr 



x 



veN u 



vGN u 



+ Pr 



E = E ^ i * (t) 



veN u 



veN u 



E k- pi 



E^ t+1) = E^ t) + 1 1 x{t) 



veN u 



veN u 



exp ^•(E^ ) + 1 )) 

V veN u J 

exp (V E^)) 

V veN u / J 

+ l) • exp (a • E 4^ 



(i + e-.^V$(*), 
n / 



(5.2) 



where (|5.2p follows from ()5. 1 j) . Note that if we only consider the allocation of a := n/(e° 
25C 2 ) = 0(n) balls, then we have 

n ) 

Define vpW := min{<&W,n • e 5 / 2 }. Then, since $W is increasing in t, equation (|5.2|) yields 

5CA\ 



E 



< 1 + e CT 



. $(*) 
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and thus inductively, E ] ^ (l -\- e (T ■ ^p-)" • ^ (0 ^ < e 5 / 4 • n. Hence, applying Markov's 
inequality gives Pr < e 5 / 2 • n] > 1 - e" 5 / 4 . By definition of ¥(<*), if < e 5 / 2 • n, then 

^r(a) = $ (a)_ Hence, Pr [$( a ) < e 5 / 2 ■ n] ^ 1 - e" 5 / 4 , as required. If $( a ) < e" 5 / 2 • n occurs, 
then since every vertex has at least one neighbor with load zero, the maximum load after the 
allocation of a balls is 1. Then, we use subadditivity (cf. Lemma 12, 7p to conclude that the 
maximum load after all [~n/af| • a balls are allocated is at most 1 • [n/a] with probability at 
least 1 - \n/a\ ■ o(l) = 1 - o(l). □ 



6 Impact of tie-breaking rules 

Proof of Theorem 11.51 We now analyze the effect of employing different tie-breaking rules. 
We first describe the construction of the graph G, which will be a d-regular graph, where 
d = w(l)asn— t-oo. Additionally, we may assume that d y/n, since otherwise the claimed 
lower bound is trivial. For the construction of G, we assume that there is an integer k such 
that 1 + Ya=o d ■ (d — l) 1 = n. Note that k = 0(logn/logd). Then, let G be a balanced tree 
with root s so that 

\N % S \ =d-{d- If' 1 for any i ^ 1. 

Hence G is a tree where all vertices except for the leaves and the root have d — 1 successors; the 
root has d successors, and the leaves have no successor. Hence, the root has degree d, the inner 
vertices have degree d as well and the leaves have degree 1. Further, note that the number of 
leaves is d ■ (d — l) k ~ 1 . In order to make the graph d-regular, we simply add edges among the 
leaves in G so that, after all edges have been added, every leaf has degree d (this is possible, 
since the number of edges to add is smaller than the total number of leaves). 

We choose a := min {(d— 1) 1//4 , k — 2}. The process of allocating the n balls will be divided 
into a phases and in each phase we consider the allocation of n/a balls. To prove the desired 
lower bound, we focus on the vertices in -B 2a . 

Next, we define an event that essentially shows that there are always enough balls so that 
the tie-breaking rule can send balls towards the root: 

a 2a 

8 := n n { Vu e n » : z » > 2 } . 

p=ie=i 

where Z p (v) := St=fo-i).(n/a)+i ^ = u ) ^ s ^ e num ber of balls born on v in phase p. Hence 
the event E means that for each vertex in N%, there is in each phase at least one neighbor 
in Ng +1 on which two balls are born; hence, for at least one ball we may be able to use the 
tie-breaking rule and forward the ball towards the root s. 

Let us estimate the probability that the event £ occurs. First, for any fixed u E N% and 
v G N u n N e s +1 

r / n i (nla\ 1 / iW Q - 2 i 

Since the events {Z p (v) ^ 2} v& n u , are negatively correlated, we have that, for any fixed u G Ng, 
3veN u D N e s +1 : Z p (v) ^2^1- (l- ^) ^ 1 - exp (~^f) ■ 



Pr 
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Hence, 



a 2a 

pr[£]£i-£ Pr 

p=i £=i ue jvj 

2a 

^ l-a-J^|JV*| -exp 

£=1 



3i> G iV u fl iVf +1 : Z p (u) > 2 



d-1 



^ 1 — exp 



d-1 



5 Or 



a • 2d(d- 1) 



2a-l 



We now claim that the last term in equation (|6,1|) is 1 — o(l). To this end, recall the 

choice of a and d = u)(l). First, since a ^ (d — l) 1 / 4 , exp (— f^jr) ^ exp ( — ^ ^— ), whereas 

a • 2d • (d - l) 2a_1 < 2d 3a < 2d 3 ^ 1 ) 1/4 = 2 • exp (logd • 3(d - 1) 1/4 ). Hence, as d -> oo, the 
probability on the right-hand side in equation (|6.ip is 1 — o(l); i.e., we have shown that 



Pr[£] ^ 1 -o(l). 




A A A A A A 



U UUUJUUUUUUUUUUUU UUMMU 




A A A A A A 
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A A A A A A 




1 juuuuuuuuuuuuuuuuuuuuul 



! I 1 



Figure 2: Illustration of the a = 3 phases after which the root vertex s has a load of at least 3. The red color indicates 
the ball which are placed on the vertex in the recent phase. The existence of a red ball on a vertex, say, u, follows, since 
there exists at least one successor of it, say, v S N u , on which at least two balls are born in that phase. 

It remains to show that if the event £ occurs, then it is possible to break the ties so that we 
end up with a maximum load of a after the allocation of n balls (see Figure [2] for an illustration). 
Our tie-breaking rule follows the simple strategy that, whenever possible, a ball is forwarded 
to a vertex closer to the root s. By Lemma 12.61 we may assume that balls are only generated 
in the set Bf and, in each phase, every vertex in this set generates at most two balls. Then 
after the first phase is completed, every vertex in B^ a contains at least one ball, since for each 
such vertex there is at least one neighbor in Bg a+ on which two balls are born. Moreover, in 
the second phase, every vertex u in Bl a ~ 2 has at least one neighbor v 6 N u in the next level 
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such that: (i) two balls are generated on v in the second phase and (ii) all neighbors of v and 
v itself have at least one ball at the beginning of phase two. Hence if ties are always broken in 
the direction towards the root, every vertex in B^ a ~ 2 will have at least two balls at the end of 
phase two. Completing the induction, we conclude that at the end of phase a, the vertex s will 
have at least a balls. Overall, we conclude that the maximum load is at least a whenever the 
event £ occurs. Since £ occurs with probability 1 — o(l), the proof is complete. □ 



7 Lower bounds for sparse graphs 

Proof of Theorem 11.61 In the first part, we show that the maximum load after n balls are 
allocated is where G is any graph with maximum degree A. Our arguments are 

almost the same as in the proof of the lower bound of Theorem [L3j We are using again the fact 
that with probability at least 1 — there is a vertex u G V on which at least i^°f^ n balls 
are born (cf. [la . Lemma 5.12]). Then, applying Lemma 12.41 with S = {u}, we obtain that the 
maximal load j3 := 1^1 satisfies 



log log n 

Since \Bu \ ^ A' 3 , the above inequality implies that 

l ° gn 



log log n ' 



log I log n 

which in turn implies that /? = O j - log(/3) ) , i.e., /3 = (nfgp) . 

Now, for the second part, we show that there exists a d-regular graph for which the maximum 

load after n balls are allocated is Q ^"gn^ • We first describe the construction of the 

ci-regular graph G. First take n/{d— 1) disjoint cliques of size d— 1 and arrange them in a cycle. 
Then connect two cliques which are next to each other in the cycle by d— 1 vertex-disjoint edges. 
This way we obtain a d-regular graph, which can be also defined as the Cartesian product of a 
cycle of length n/(d — 1) and a clique of size d — 1. 



Figure 3: Illustration of the construction of the graph G, where d — 1 = 4 and n/ (d — 1) = 9, so n = 36. 

Let us now consider the number of balls that are born in each clique. This can be seen as 
the 1-choice process where n balls are randomly placed into n/{d — 1) bins. By Raab and Steger 



171 . Theorem 1, Cases 1 & 2] it follows that, with probability 1 — o(l), there exists a clique with 



vertex set S, \S\ = d — 1, so that the number of balls born in S is at least 

log n 



log(n/(d-l)) 
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for some constant C > 0. Now we use Lemma 12.41 with the &s above to conclude that the 



maximum load /3 = X^lx satisfies 



P-\B's\>C 



log n 



log 



log(n/(d-l)) 
d-1 



Since \B§\ < (2/3 + 1) • d and log(n/(d-l)) = 8(logn), this implies /3 = O ( J - g „ - ) . □ 
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A Standard technical results 

Lemma A.l (Azuma's inequality |2J, Theorem 7.2.1]). Let Xq, Xi, . . . , X m be a martingale such 
that there exists a fixed positive c for which |Xj — ^ c for all i. Then, 



Pr[|X m -X | > A] ^exp 



A 2 



2c 2 



m 



Lemma A. 2 (Azuma's inequality with variance bound 0, Theorem 6.1]). Let X , X u . . . , X m 

be a martingale adapted to the filtration T%. Suppose that there exists a fixed positive c for which 
\Xi — Xi-\\ ^ c for all i and there exists d such that E [ (Xi — Aj_i) 2 | T{-\ ] ^ d for all i. 
Then, 

( A 2 

Pr [ \X m - X \ > A] < exp -— — — - 

\ 2c'm + cA/3 

For the special case where Xq, Xi,..., X m are independent Bernoulli random variables, we 
can apply the above lemma to the random variables (Xi — E [ Xi ])j with d = E [ X\ ] and c = 1 
to obtain the inequality below. 

Lemma A. 3. Let Xi, . . . ,X m be m independent, identically distributed Bernoulli random vari- 
ables. Let X := YliL l Xi. Then, for any A > 0, 



Pr [\X -E[X] | ^ A] < exp 



2E[A] + A/3 



Lemma A. 4. Consider the 1-choice process {X v }„<=y where n balls are allocated to n bins cho- 
sen independently and uniformly at random. Let £q := 8eA 2 and let A^ := 
Then, for any £ with £q ^ £ = o(log 2 n), 



Pr 



|A,| > 4 - (2A)^ + ^ 
4A ' £ 



< 2n 



- log 5 n 



Proof. Fix any £ with £ ^ £q. Let {X v } v ^v be n independent poisson random variables with 
mean 1. Define A £ := ju 6 V: X^ ] > ij. Since {|A*| ^ • (2A)~* + is a monotone 

event in the number of balls 1 ^ n ^ |V|, it follows by a standard "Poissonization argument" 
(see, e.g., [13|, Corollary 5.11]) 



Pr 



. , n / a \ p log 7 n 
|A,| > — ■ (2A)-* + -L 



< 2 • Pr 



n log n 



A,>-.(2Ar + 
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To bound the latter probability, let us first estimate E |A^| . First, if P is a Poisson ran- 
dom variable with parameter 1, then we have the following Chernoff-type inequality (|2j, Theo- 
rem A. 1.15]): for any e > 0, 

Pr[P ^ (1 + e)] ^ e e (l + e)-( 1+£ ). 
As long as e ^ 8eA 2 — 1, we can write 

Pr[P ^ 1 + e] ^ e 1+£ (8eA 2 )-( 1+£ ) (8A 2 )-( 1+e) , 
and hence, replacing 1 + e by i gives 

Pr[P^] ^ (8A 2 )^. 

Now observe that | A^| is stochastically smaller than the sum of n independent Bernoulli random 
variables Z\, . . . , Z n , each with parameter (8A 2 )^^. Hence, if we denote Z := Y27=i then 



Pr 



. ~ . n . A . f log n 
|A^-.(2A)- + ^ 



^ Pr 



n , « % e log 7 n 
Z ^4A- (2A) +J T~ 



Note that B[Z] = n- (8A 2 )"\ Hence by Lemma IA73 



Pr 



n 



log 7 n 



< Pr 



^ exp 



Z > E[Z] + 



log 7 n n 

— 1 (2AV 

I 8A V ; 



/ 



V 



V 2 + & • ( 2A )~ 



2E[Z] + ^ + ^-(2A) 



To bound the last term, we proceed by a case distinction. The first case is when log 7 n/^ ^ 
& • (2A)"f Then also E [Z] < log 7 n/£ and hence 



exp 



( (H 2 + m • ( 2A r 



v 



2E[Z] + i^ + 4-(2A)- 
Otherwise, log 7 n/^ < ^ • (2A) _/ . Then, 







^ exp 




A log 7 n 







< n 



/ 



exp 



v 1 + m • ( 2A )~ 



2E[Z] + ^ + ^-(2A) 



exp 



{& • ( 2A r r 

4-^-(2A)- 



■ log n 



Hence, 



Pr 



n , . , o log 7 n 



s; 2 • ri 



- log 5 n 



as desired. 



□ 
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